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Abstract. The Radial Velocity Experiment (RAVE) is a large wide- 
held spectroscopic stellar survey of the Milky Way. Over the period 
2003-2013, 574,630 spectra for 483,330 stars have been amassed at a 
resolution of R=7500 in the Ca-triplet region of 8410-8795A. Wave¬ 
length coverage and resolution are thus comparable to that anticipated 
from the Gaia RVS. Derived data products of RAVE include radial ve¬ 
locities, stellar parameters, chemicals abundances for Mg, Al, Si, Ca, 
Ti, Fe, and Ni, and absorption measures based on the diffuse interstel¬ 
lar bands (DIB) at 8620A. Since more than 290000 RAVE targets are 
drawn from the Tycho-2 catalogue, RAVE will be an interesting pro¬ 
totype for the anticipated full Gaia data releases, in particular when 
combined with the early Gaia data releases, which contain astrometry 
but not yet stellar parameters and abundances. 


1 Introduction 

Despite the fact that the Galaxy is one unique system, understanding its formation 
holds important clues to study the broader context of galaxy formation. Wide field 
spectroscopic surveys play a particularly important role in analysis of the Milky 
Way: Spectroscopy enables us to measure the radial velocity, which in turn allows 
us to study the details of Galactic dynamics. Spectroscopy also permits to measure 
the abundance of chemical elements in a stellar atmosphere which holds important 
clues on the initial chemical composition and its subsequent metal enrichment. 
Despite this importance, ten years ago wide-held spectroscopic surveys of the 
Milky Way was still limited to the Geneva Copenhagen survey (CGS, Nordstrom 
et al, 2004), which only covered a sphere of about 100 pc radius around the sun 
(the so-called Hipparcos sphere). 

The situation has fundamentally changed over the past decade, with several 
wide-held spectroscopic surveys underway: SDSS-SEGUE and RAVE being com- 
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pleted in terms of data taking, LAMOST, APOGEE and HERMES well underway, 
and some massive campaigns such as 4MOST, WEAVE and DESI in the making, 
each of the latter delivering spectroscopic data for some 10 million stars. The Gaia 
mission will also not only provide exquisit distances and proper motions for up to 

1 billion stars, it also will deliver spectra for some 100 million stars. The RAVE 
survey can play a particular role in the preparation of the Gaia era, as the spectral 
range, signal to noise and spectral resolution of RAVE spectra is similar to those 
expected from the Gaia- RV10 

2 RAVE survey description 

RAVI0 began observations in 2003, and since then has released four data releases 
(noted DR hereafter): DR1 in 2006 (Steinmetz et al. 2006), DR2 in 2008 (Zwitter 
et al. 2008), DR3 in 2011 (Siebert et al. 2011), and DR4 in 2013 (Kordopatis 
et al. 2013a). RAVE is a magnitude-limited survey of stars randomly selected 
in the southern celestial hemisphere. The original design was to only observe 
stars in the interval 9 < I < 12 but owing to the IR based selection function, 
some stars that are brighter and fainter can be found. The spectra are obtained 
from the 6dF facility on the 1.2m Australian Astronomical Observatory’s UK- 
Schmidt telescope in Siding Spring, Australia, where three field plates with up 
to 150 robotically positioned fibres are used in turn. The effective resolution of 
RAVE is R = A/AA ~ 7500 and the wavelength range coverage is around the 
infrared ionised Calcium triplet (IR Call, AA8410 — 8795A), one of the widely used 
wavelength ranges for Galactic archaeology. It is also the wavelength range in 
which the Gaia RVS is operating. RAVE overall has amassed 574 630 spectra for 
483 330 unique stars, 425 561 of these targets are included in DR4. 

DR4’s completeness with respect to 2MASS is shown for various magnitude 
bins in the Aitoff projections shown in Figure [T] Each 6dF field set-up only 
selects targets from one of the following four magnitude bins: 9.0 < I < 10.0 
(8.0 < / < 10.0 for the new 2MASS input catalogue), 10.0 < / < 10.8, 10.8 < 
I < 11.3 and 11.3 < I < 12 mag. This minimizes the magnitude range within 
any one 6dF field set-up to be within a bin, meaning exposure times can be scaled 
more appropriately for all the targets in the same field. Each field set-up is a 
random selection of unobserved targets within these bins (apart from designed 
repeat observations). Any spectrum within a 6dF field set-up can be adjacent 
to any other in the same set-up on the CCD but the bins limit the magnitude 
difference, which also minimizes fibre cross-talk. Figure [I] also shows that while 
for the brighter magnitudes bins, high rates of completeness have been achieved 
(with some individual fields reaching close to 100% compared to 2MASS), RAVE 
has not exhausted the input catalogue for fainter targets, in particular for I > 11.3. 


1 For visualization of the volume covered by some of these surveys, see http://www.rave- 
survey. org/project/gallery/ movies / 
z http: //www.rave-survey.org 
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Fig. 1. Aitoff projection (Galactic coordinates) of the completeness of the stars in the 
-Umass band for which radial velocity measurements are available. Each panel shows a 
different magnitude bin. Grey-scale coding represents the ratio of RAVE observations to 
2MASS stars, (from DR4) 


3 RAVE data products 

3.1 Radial velocities 

Radial velocities (RV) are obtained using a standard cross-correlation in Fourier 
space on the continuum subtracted spectra (see DR3 for details) in a two-step 
procedure: First an estimate of the RV is determined using a subset of 10 template 
spectra. This first estimate gives an RV estimate with an accuracy better than 
5 km s -1 and is used to shift the spectrum to the zero velocity frame. Using 
the full template database then a new template is constructed using a penalized 
chi-square technique (see DR2), which then in turn is used to derive a the more 
precise RV published in the data base. The histograms of the internal error of the 
RVs in DR1, DR2, DR3 and DR4 is shown in Fig. [2| It can be seen that 68% 
of the targets have an internal RV error better than 1.4 km s -1 in DR4 (see left 
frame of Figure 2). Error estimates derived from repeat observations and from 
comparison with external reference stars give values consistent with the numbers 
quoted above. 

3.2 Atmospheric parameters 

The wavelength region AA8410 —8795A is very attractive for Galactic archaeology 
purposes, as it features relatively few telluric absorptions, but exhibits many iron 
and a-element lines, in particular the prominent Ca triplet, which allows rela- 
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Fig. 2. Top left: The histograms of the internal radial velocity error for data new 
to each data release. The bin size is 0.2 km s -1 . For DR4, the number of stars per 
bin is divided by 15 to compensate for the increase in sample size. Bottom left: The 
cumulative distributions, where the dotted lines mark 50, 68 and 95% of the samples. 
Middle: Comparison of the reference values found in the literature and the derived 
effective temperatures. Dotted diagonal lines represent offsets from unity of ±300 K. 
The mean offsets (g) and the dispersions (cr) are indicated in the upper left corner of 
each plot. Right: Same for log g, dotted diagonal lines represent offsets from unity of 
±0.5 dex. 


tively easily radial velocity measurements and metallicity estimations for any type 
of spectrum. However, spectra with a resolution R < 10 000 suffer from spectral 
degeneracies that, if not appreciated, can introduce serious biases in spectroscopic 
surveys that use automated parameterization pipelines. These degeneracies are 
mostly important for cool main-sequence stars and stars along the giant branch 
(see Kordopatis et al. 2011 for a discussion). In matching observed spectra to 
templates in a library, decision-tree methods have been shown to perform better 
compared to other algorithms, like the projection methods (e.g.: principal compo¬ 
nent analysis) or the ones trying to solve an optimization problem (e.g.: minimum 
y 2 ), in particular when the SNR is low. The RAVE DR4 pipeline therefore em¬ 
ploys a hybrid of a decision-tree algorithm called DEGAS (Bijaoui et al. 2012) and 
a projection method called MATISSE (Recio-Blanco et al. 2006) which improves 
the interpolation between the grid points. A comparison with a set of reference 
stars (see middle and right frame of Figure 2) shows a satisfactory dispersion of 
about 400K for the derived T e ff (considerable less for a high S/N subsample), while 
the log g determination is still suffering from the aforementioned degeneracies in 
the Ca triplet region (see table 1 and 2 in DR4). 

3.3 Chemical abundances 

Abundances for individual chemical elements are determined for the elements Mg, 
Al, Si, Ti, Ni and Fe based on a curve of growth analysis using the atmospheric 
parameters of the previous section as input values for T e ff and log g (Boeclie et 
al. 2011). The chemical pipeline relies on an equivalent widths (EWs) library 
which contains the expected EWs of the lines visible in the RAVE wavelength 
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Fig. 3. Median azimuthal velocities, v^,, and metallicities, [M/H], for all the RAVE stars 
for which distance and velocity determinations were available. The bins in R and Z are 
0.25 kpc and contain at least 50 stars each. The black ” + ” sign is at ( Rq = 8 kpc, 
Zq = 0 kpc), the assumed position of the Sun (from Kordopatis et al. 2013b). 


range (604 atomic and molecule lines). These EWs are computed for a grid of 
stellar parameters values covering the range [4000,7000] K in T e ff, [0.0,5.0] dex in 
log g and [—2.5,+0.5] dex in [M/H] and five levels of abundances in the range 
[—0.4, +0.4] dex. The chemical pipeline constructs on-the-fly spectrum models by 
adopting the effective temperatures and surface gravities obtained by the DR4 
atmospheric parameters pipeline. It then searches for the best fitting model by 
minimizing the x 2 between the models and the observational data. The estimated 
errors in abundance, based on a comparison with reference stars, depend on the 
element and range from 0.17 dex for Mg, Al and Ti to 0.3 dex for Ti and Ni. The 
error for Fe is estimated as 0.23 dex. 


3.4 Distances 

Distances are estimated for spectra with SNR> 10 pixeR 1 by projecting the atmo¬ 
spheric parameters onto a the Padova set of theoretical isochrones and obtaining 
the most likely value of the absolute magnitude of the stars. DR4 employs an 
algorithm based on the Bayesian distance-finding method presented in Binney et 
al. (2014), and takes into account the interstellar extinction, as well as kinematic 
correction factors obtained by the method of Schonrich et al. (2012). Testing using 
Hipparcos stars indicates that the inverse of the expectation value of the parallax 
is the most reliable distance estimator. Compared to a subset of Hipparcos stars, 
the method results in an over-estimation of less than 10% for the dwarfs and less 
than 20% for the giants. The method has also been tested on the open cluster 
spectra, delivering very satisfactory distances, provided a cluster-specific age prior 
is used. 
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4 RAVE compared to the Gaia RVS 

Based on the currently available proper motions (UCAC4), radial velocities derived 
from RAVE spectra and distances derived from matching RAVE stellar parame¬ 
ters with isochrones, for 80% of the stars in the RAVE volume, 3D velocities are 
known to better than 20 km s _1 . Combined with the large number of targets, 
this allows already detailed investigations on the structure and kinematics of the 
Milky Way disk(s) (see Figure 3). Recent examples include: the detection of a 
wave-like pattern in the stellar velocity distribution (Williams et al. 2013); an 
improved determination of the total mass of the Milky Way (Piffl et al. 2014a) 
and of the local dark matter density (Bienayme et al. 2014, Piffl et al. 2014b); 
the identification of chemo-dynamical signatures with respect to a merger origin 
of the Galactic disk (Minchev et al. 2014); the identification of stars in the local 
neighborhood with supersolar metallicity, which likely have formed closer to the 
Galactic center and then moved outwards by radial migration (Kordopatis et al. 
2015); the identification of stars tidally stripped of the globular clusters M22, NGC 
1851 and NGC3201 (Kunder et al. 2014, Anguiano et al. 2015); the creation of 
pseudo-3D maps of the diffuse interstellar band at 8620A (Kos et al. 2014); and 
the detection of a metal-rich high-velocity star born in the Galactic disk (Hawkins 
et al. 2014). 

The available astrometry of Milky Way stars will dramatically improve already 
with the first Gaia data releases (Prusti et al. 2012), bringing the accuracy of the 
3D velocities down to better than 5 km s'"' 1 , and at the same time considerably in¬ 
creasing the probed volume. Regarding the stellar parameters derived from Gaia 
spectroscopy, changes in terms of accuracy will be more gradual, as the wave¬ 
length range of Gaia-RVS and RAVE is almost identical, though the resolution 
of the Gaia-RVS is somewhat higher (10 500 for Gaia-RVS vs 7 500 for RAVE). 
However, the volume probed at similar accuracy will be 1-2 magnitudes deeper 
with much higher completeness, in particular at faint magnitudes. A particularly 
interesting data set for the next few years will be the combination of Gaia astrom¬ 
etry, as published in the early data releases, with RAVE stellar parameters, radial 
velocities and abundances, as according to the current Gaia release schedule, the 
latter properties will only be provided in the later data releases. 

5 Outlook 

The RAVE collaboration currently investigates a number of improvements for the 
next data release. Besides analyzing the as of yet not published stars, a recalibra¬ 
tion for super solar metallicities has been devised (Kordopatis et al. 2015). The 
distance pipeline has been found to systematically underestimate the distances for 
low metallicities ([Fe/H]< —1), as it is e.g. for stars stripped off globular clusters. 
A correction using lower-metallicity isochrones is underway. The release of optical 
photometry of APASS (Munari et al. 2014) will allow to determine T e ff with the 
aid of additional photometric priors, reducing the degeneracy in the log g deter¬ 
mination. Last but not least, we expect to get additional calibration data for log g 
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from Kepler-2 astroseismology. Kepler-2 campaigns 0-2 currently have some 1400 
RAVE stars on their target list. 
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